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Description 



[0001] The present invention relates to a lithographic projection apparatus comprising: 

5 o a radiation system for supplying a projection beam of radiation; 

• a support structure for supporting patterning means, the patterning means serving to pattern the projection beam 
according to a desired pattern; 

- a substrate table for holding a substrate; and 

a projection system for projecting the patterned beam onto a target portion of the substrate. 

10 

[0002] The term patterning means* as here employed should be broadly interpreted as referring to means that can 
be used to endow an incoming radiation beam with a patterned cross-section, corresponding to a pattern that is to be 
created in a target portion of the substrate; the term c l!ght valve 3 can also be used in this context. Generally, the said 
pattern will correspond to a particular functional layer in a device being created in the target portion, such as an Inte- 
rs grated circuit or other device (see below). Examples of such patterning means include: 

- A mask. The concept of a mask is well known in lithography, and it includes mask types such as binary, alternating 
phase-shift, and attenuated phase-shift, as well as various hybrid mask types. Placement of such a mask in the 
radiation beam causes selective transmission (in the case of a transmissive mask) or reflection (in the case of a 
reflective mask) of the radiation impingfng on the mask, according to the pattern on the mask. In the case of a 
mask, the support structure will generally be a mask table, which ensures that the mask can be held at a desired 
position in the incoming radiation beam, and that it can be moved relative to the beam if so desired. 
A programmable mirror array. An example of such a device is a matrix-addressable surface having a viscoelasttc 
control layer and a reflective surface. The basic principle behind such an apparatus is that (for example) addressed 
areas of the reflective surface reflect incident light as diffracted light, whereas unaddressed areas reflect incident 
light as undiffracted light. Using an appropriate filter, the said undiffracted light can be filtered out of the reflected 
beam, leaving only the diffracted light behind; in this manner, the beam becomes patterned according to the ad- 
dressing pattern of the matrix-addressable surface. The required matrix addressing can be performed using suit- 
able electronic means. More information on such mirror arrays can be gleaned, for example, from United States 
Patents US 5,296,891 and US 5,523,193, which are incorporated herein by reference. In the case of a program- 
mable mirror array, the said support structure may be embodied as a frame or table, for example, which may be 
fixed or movable as required. 

- A programmable LCD array. An example of such a construction is given in United States Patent US 5,229,872, 
which is incorporated herein by reference. As above, the support structure In this case may be embodied as a 
frame or table, for example, which may be fixed or movable as required. 

For purposes of simplicity, the rest of this text may, at certain locations, specifically direct itself to examples involving 
a mask and mask table; however, the general principles discussed in such instances should be seen in the broader 
context of the patterning means as hereabove set forth. 

[0003] Lithographic projection apparatus can be used, for example, in the manufacture of integrated circuits (ICs). 
In such a case, the patterning means may generate a circuit pattern corresponding to an individual layer of the IC, and 
this pattern can be imaged onto a target portion (e.g. comprising one or more dies) on a substrate (silicon wafer) that 
has been coated with a layer of radiation-sensitive material (resist). In general, a single wafer will contain a whole 
network of adjacent target portions that are successively irradiated via the projection system, one at a time. In current 
apparatus, employing patterning by a mask on a mask table, a distinction can be made between two different types of 
machine. In one type of lithographic projection apparatus, each target portion is irradiated by exposing the entire mask 
pattern onto the target portion in one go; such an apparatus is commonly referred to as a wafer stepper. In an alternative 
apparatus — commonly referred to as a step-and-scan apparatus — each target portion is irradiated by progressively 
scanning the mask pattern under the projection beam in a given reference direction (the "scanning" direction) while 
synchronously scanning the substrate table parallel or anti-parallel to this direction; since, in general, the projection 
system will have a magnification factor M (generally < 1), the speed V at which the substrate table is scanned will be 
a factor M times that at which the mask table is scanned. More information with regard to lithographic devices as here 
described can be gleaned, for example, from US 6,046,792, incorporated herein by reference. 
[0004] In a manufacturing process using a lithographic projection apparatus, a pattern (e.g. in a mask) is imaged 
onto a substrate that is at least partially covered by a layer of radiation-sensitive material (resist). Prior to this imaging 
step, the substrate may undergo various procedures, such as priming, resist coating and a soft bake. After exposure, 
the substrate may be subjected to other procedures, such as a post-exposure bake (PEB), development, a hard bake 
and measurement/inspection of the imaged features. This array of procedures is used as a basis to pattern an individual 



23 



30 



2 



EP1 2S5 104 A1 



layer of a device, e.g. an :c. Such a patterned layer may then undergo various processes such as etching, lon-imp!an- 
tation (doping), metallization, oxidation, chemo-mechanical polishing, etc., all intended to finish off an individual layer. 
If several layers are required, then the whole procedure, or a variant thereof, will have to be repeated for each new 
layer. Eventually, an array of devices wiil be present on the substrate (wafer). These devices are then separated from 
one another by a technique such as dicing or sawing, whence the individual devices can be mounted on a carrier, 
connected to pins, etc. Further information regarding such processes can be obtained, for example, from the book 
"Microchip Fabrication: A Practical Guide to Semiconductor Processing 0 , Third Edition, by Peter van Zant, McGraw 
Hill Publishing Co., 1997, ISBN 0-07-087250-4, incorporated herein by reference. 

[0005] For the sake of simplicity, the projection system may hereinafter be referred to as the n lens y ; however, this 
term should be broadly interpreted as encompassing various types of projection system, including refractive optics, 
reflective optics, and catadioptrlc systems, for example. The radiation system may also include components operating 
according to any of these design types for directing, shaping or controlling the projection beam of radiation, and such 
components may also be referred to below, collectively or singularly, as a c iens a . Further, the lithographic apparatus 
may be of a type having two or more substrate tables (and/or two or more mask tables). In such "multiple stage" devices 
the additional tables may be used in paralie!, or preparatory steps may be carried out on one or more tables while one 
or more other tables are being used for exposures. Twin stage lithographic apparatus are described, for example, in 
US 5,969,441 and WO 98/40791 , incorporated herein by reference. 

[0006] The manufacture of ICs and other devices with such apparatus generally involves the replication of extremely 
fine sub-micron patterns, with an exceptionally high degree of positional accuracy. For this reason, it is essential to 
properly isolate various critical parts of the apparatus (such as the substrate table and mask table, for example) from 
spurious motion, vibration, mechanical shocks, etc. In general, this is achieved using such measures as carefully 
designed metrology frames, air-mounts, motional counterweights and dampers, which serve to isolate the apparatus' 
critical parts from most unwanted mechanical influences. However, such measures are not completely effective in 
eliminating a number of unwanted influences, such as, for example: 

1. vibrations In the substrate table due to leveling actions during exposure; 

2. vibrations caused by motion of reticle masking blades; 

3. resonance effects caused by the presence of air showers; 

4. vibrations in the substrate table caused by motion of the mask table, and vice versa, and; 

5. influence of air shower flow on the substrate table. 

[0007] Although these effects are relatively small, they become increasingly important as the need to produce ever- 
higher device resolutions increases, and they now form a substantial barrier to the viable realization of large-area ICs 
having critical dimensions of the order of 0.1 5 ujn and less. 

[0008] Accordingly, it has been proposed in EP-0 967 525-A to provide a control system for the substrate and mask 
tables of a lithographic apparatus in which errors in the position of the substrate table are compensated for by their 
inclusion as a feed-forward control in the mask table control loop. Specifically, the substrate table error is lowpass 
filtered, the output of the filter is then added to the mask table setpoint, and also twice differentiated, multiplied by the 
mask table mass and the resultant force applied to the mask table. This proposal is based on the realization that the 
absolute positions of the mask and substrate tables are less important than their relative position and allows the cor- 
rection of substrate table errors beyond the mask table bandwidth. However, this control system has performance 
limits, in part caused by the inevitable time delay in processing the substrate table error. 

[0009] It is an object of the present invention to provide an improved control system for the support structure for the 

patterning means, e.g. a mask, and substrate tables of a lithographic projection apparatus. 

[0010] This and other objects are achieved according to the Invention in a lithographic apparatus comprising: 

a radiation system for providing a projection beam of radiation; 

- a support structure for supporting patterning means, the patterning means serving to pattern the projection beam 
according to a desired pattern; 

a substrate table for holding a substrate; 

- a projection system for projecting the patterned beam onto a target portion of the substrate, 

- first driving means, for moving the support structure in a given reference direction substantially parallel to the plane 
of the patterning means; 

second driving means, for moving the substrate table parallel to the reference direction so as to be synchronous 
with the motion of the support structure; 

first measuring means, for determining the momentary position of the patterning means with respect to a fixed 
reference point; 

second measuring means, for determining the momentary position of the substrate table with respect to a fixed 
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reference point; and 

means for comparing the measured momentary position of the substrate table with a desired momentary position 
of the substrate table, for generating a position error signal in accordance with a difference between the said two 
positions; 

5 

characterized in that the apparatus further comprises: 

prediction means for generating a future position error signal based on one or more previous position error signals 
and for passing that future position error signal to correction means which serve to adjust the momentary position 
10 of the patterning means so as to compensate for such difference. 

[001 1J With the present invention, the drawbacks of the prior art are avoided and a improvement of 30 to 50% in the 
high-frequency synchronization error between the substrate and mask tables can be achieved. Preferably, the predic- 
tion is a sine-based prediction which can be readily be adapted to predict the second derivative of the position error 
15 signal thereby directly providing the basis for an acceleration error signal. Such a sine-based prediction can advanta- 
geously be implemented using one or more finite impulse response (FIR) filters. 

[0012] According to a further aspect of the invention there is provided a device manufacturing method comprising 
the steps of: 

20 „ providing a substrate that Is at least partially covered by a layer of radiation-sensitive material; 
providing a projection beam of radiation using a radiation system; 
using patterning means to endow the projection beam with a pattern in its cross-section; 

projecting the patterned beam of radiation onto a target portion of the layer of radiation-sensitive material, wherein 
said step of projecting comprises the sub-steps of: 

25 

moving a support structure supporting said patterning means in a given reference direction substantially par- 
allel to the plane of the patterning means; 

moving the substrate table parallel to the reference direction so as to be synchronized with the motion of the 
support structure; 

30 - determining the momentary position of the mask table with respect to a fixed reference point; 

determining the momentary position of the substrate table with respect to a fixed reference point; and 
- comparing the measured momentary position of the substrate table with a desired momentary position of the 
substrate table, for generating a position error signal in accordance with a difference between the said two 
positions; 

35 

characterized by the further step of predicting a future position error based on one or more previous position error 
signals and adjusting the momentary position of the mask table so as to compensate for such difference. 
[0013] Although specific reference may be made in this text to the use of the apparatus according to the invention 
in the manufacture of ICs, it should be explicitly understood that such an apparatus has many other possible applica- 
40 tions. For example, it may be employed in the manufacture of integrated optical systems, guidance and detection 
patterns for magnetic domain memories, liquid-crystal display panels, thin-film magnetic heads, etc. The skilled artisan 
will appreciate that, in the context of such alternative applications, any use of the terms "reticle 0 , "wafer" or B die" in this 
text should be considered as being replaced by the more general terms "mask", "substrate" and "target portion", re- 
spectively. 

45 [0014] In the present document, the terms "radiation" and "beam" are used to encompass all types of electromagnetic 
radiation, including ultraviolet radiation (e.g. with a wavelength of 365, 248, 193, 157 or 126 nm) and EUV (extreme 
ultra-violet radiation, e.g. having a wavelength in the range 5-20 nm), as well as particle beams, such as ion beams or 
electron beams. 

[001S] Embodiments of the invention will now be described, by way of example only, with reference to the accom- 
so panying schematic drawings in which: 

Figure 1 depicts a lithographic projection apparatus according to an embodiment of the invention; 
Figure 2 is a control theory circuit suitable for implementation in a first embodiment of the invention; 
Figures 3A and B are graphs of the magnitude and phase of the transfer functions of sine-based extrapolation 
55 predictors used in the first embodiment of the invention; 

Figure 4 is a control theory circuit suitable for implementation in a second embodiment of the invention; 
Figures 5A and B are graphs of the magnitudes and phases of the transfer functions of a control system according 
to the invention and two prior art control systems; 
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Figure 6 Es a graph of the feedthrough factors c! a control system according to the invention and two prior art controJ 
systems; 

Figures 7 A and B are graphs of the magnitude and phase of the transfer functions of sine-based double derivative 
predictors used in a second embodiment of the invention; 
s Figure 8 is a pole zero map of a double derivative predictor used in the second embodiment of the invention; 

Figures 9 is a graph used to illustrate polynomial extrapolation; and 

Figures 10A and B are graphs of the magnitude and phase of the transfer functions of polynomial extrapolation 
predictors used in a third embodiment of the invention. 

10 [0016] in the Figures, corresponding reference symbols indicate corresponding parts. 

Embodiment 1 

f0017] Figure 1 schematically depicts a lithographic projection apparatus according to a particular embodiment of 
*5 the invention. The apparatus comprises: 

a radiation system Ex, IL, for supplying a projection beam PB of radiation (e.g. U V radiation), which in this particular 
case also comprises a radiation source LA; 

a first object table (mask table) MT provided with a mask holder for holding a mask MA (e.g. a reticle) , and connected 
20 to first positioning means for accurately positioning the mask with respect to item PL; 

a second object table (substrate table) WT provided with a substrate holder for holding a substrate W {e.g. a resist- 
coated silicon wafer), and connected to second positioning means for accurately positioning the substrate with 
respect to item PL; 

a projection system ("lens") PL {e.g. a refractive lens system) for imaging an irradiated portion of the mask MA 
25 onto a target portion C {e.g. comprising one or more dies) of the substrate W. 

As here depicted, the apparatus is of a transmisstve type (i.e. has a transmissrve mask). However, in general, it may 
also be of a reflective type, for example (with a reflective mask). Alternatively, the apparatus may employ another kind 
of patterning means, such as a programmable mirror array of a type as referred to above. 

30 [0018] The source LA (e.g. an Hg lamp or excimer laser) produces a beam of radiation. This beam is fed into an 
illumination system (illuminator) IL, either directly or after having traversed conditioning means, such as a beam ex- 
pander Ex, for example. The illuminator IL may comprise adjusting means AM for setting the outer and/or inner radial 
extent (commonly referred to as a-outer and a-inner, respectively) of the intensity distribution in the beam. In addition, 
it will generally comprise various other components, such as an integrator IN and a condenser CO. In this way, the 

33 beam PB impinging on the mask MA has a desired intensity distribution in its cross-section. 

[0019] It should be noted with regard to Figure 1 that the source LA may be within the housing of the lithographic 
projection apparatus (as is often the case when the source LA is a mercury lamp, for example), but that it may also be 
remote from the lithographic projection apparatus, the radiation beam which it produces being led into the apparatus 
(e.g. with the aid of suitable directing mirrors); this latter scenario is often the case when the source LA is an excimer 

40 laser. The current invention and Claims encompass both of these scenarios. 

[0020] The beam PB subsequently intercepts the mask MA, which is held on a mask table MT. Having traversed the 
mask MA, the beam PB passes through the lens PL, which focuses the beam PB onto a target portion C of the substrate 
W. With the aid of the second positioning means (and interfero metric measuring means IF), the substrate table WT 
can be moved accurately, e.g. so as to position different target portions C in the path of the beam PB. Similarly, the 

43 first positioning means can be used to accurately position the mask MA with respect to the path of the beam PB, e.g. 
after mechanical retrieval of the mask MA from a mask library, or during a scan. In general, movement of the object 
tables MT, WT will be realized with the aid of a long-stroke module (coarse positioning) and a short-stroke module (fine 
positioning), which are not explicitly depicted in Figure 1 . However, in the case of a wafer stepper (as opposed to a 
step-and-scan apparatus) the mask table MT may just be connected to a short stroke actuator, or may be fixed. 

so [0021] The depicted apparatus can be used in two different modes: 

1 . In step mode, the mask table MT is kept essentially stationary, and an entire mask image is projected in one go 
(i.e. a single 'Hash 0 ) onto a target portion C. The substrate table WT is then shifted in the x and/or y directions so 
that a different target portion C can be irradiated by the beam PB; 
S3 2. In scan mode, essentially the same scenario applies, except that a given target portion C is not exposed in a 

single "flash 0 . Instead, the mask table MT is movable in a given direction (the so-called a scan direction", e.g. the 
y direction) with a speed v, so that the projection beam PB is caused to scan over a mask image; concurrently, 
the substrate table WT is simultaneously moved in the same or opposite direction at a speed V= Mv, in which M 
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is the m&qnTcatOTi of the iens PL (typica^y, M- 1/4 or 1/5). In this manner, a relatively large target portion C can 
be exposed, without having to compromise on resolution. 

[0022] Figure 2 depicts a control theory diagram of a control system used In an apparatus according to a first em- 
5 bodiment of the invention. The top loop represents the mask table (or reticle stage: R3), consisting of the mechanical 
transfer H RS and controller G RS . The bottom loop is the substrate table (or wafer stage: WS), consisting of the me- 
chanical transfer H ws and controller G^. The mask table receives a setpoint SPR which is four times the setpoint 
SPW of the substrate table, while its output only counts % in the relevant WS/RS error, reflecting the fact that the pattern 
image is projected by the projection system PL with a magnification M=U and that the mask table scans at four times 
10 the speed of the substrate. 

[0023] Via a predictor 1 0 desenbed further below, the substrate table error e^ s is added to the mask table setpoint 
SPR, and a WS acceleration error is generated. This error is combined with the output of the mask table controller 
G HS so that an extra force is generated which , when applied to the mask stage, should give it four times th© acceleration 
of the WS. When applied on its own, the mask table controller G RS would consider this force a disturbance, and would 
15 react and try to diminish the effect of the feedforward. By additionally feeding the predicted substrate table error to the 
mask table setpoint, this problem (s solved. Now, if the mask table reacts to the extra feedforward force as a transfer 
1/ms 2 , the mask stage error e RS remains zero and hence the mask stage controller is left "unaware" of any extra force 
injection. 

[0024] Details of the predictor 10 will now be explained with reference to Figures 5 to 7. The predictor 10 employs 
20 a prediction by sine-fit method which assumes a limited number of dominant frequencies in the signal to be predicted. 
Assume that the substrate table position error e V¥S consists of m sinusoids with unknown amplitude and phase, but 
with known frequencies: 



25 



y{t)= v ^-sin(2^rj+^cos(2^;-fj (i) 



[0025] Further, assume that n samples in the past are used to fit the data on. With two dominant frequencies / 1 and 
30 f 2t the following equation arises: 
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[0026] Hence, for n>4, an over-determined system arises with more equations than unknowns. Using a least-squares 
algorithm, a 1t a^ b u b 2 car\ be solved. For n=4, four equations with four unknowns arise, that can be solved analytically 
by inverting the above matrix. Hence, in an on-line prediction situation, every sample the above 4x4 matrix must be 
generated, inverted, and multiplied with the y-vector containing the last 4 samples of the data to be predicted. The 
resulting a and b coefficients can then be used to predict the value of y at the next sample f 0 : 



y{t 0 ) = a, sin(2n/ 1 f 0 ) + b A oos(2nf^t 0 ) + a z sin(2n/ 2 f 0 ) + b 2 cos(2n/ 2 f 0 ) 



(4) 



[0027] A computationally more efficient way for the case that the matrix is square (n=2m), lies in a better choice of 
the time scale. The time scale is shifted such that at the prediction target time instant, t is defined as 0. The time scale 
hence shifts each sample. The current sample y(t1) is inserted into the algorithm as if it belongs to a point on the time 
scale at f=-7s, y(t2) belongs to t=-2Ts, and so on. There is no reason to keep the time scale absolute, and in this case 
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it Is advantageous to shift ft each samp:© such that the £m© Instant of the predicted value coincides with t=0 on the 
time seals, in that case: 

1 . The A matrix becomes constant and can be constructed and inverted offline, 

2. Finding the predicted value only involves adding b1 and b2, because the time instant one sample in the future 
is at t=0. 

[0028] Finding the 1 -sample prediction then actually boils down to calculating a third-order FIR filter: 

y(/c+1) = (c 21 + c 41 ) y{k) + (c^ +c 42 ) y(k- 1) + 

( C 23 + C4s)yl k ' 2 ) + (^4 + c 44)V( k ' 3 ) (5) 

13 Cjj being the (i, j)^ coefficient of A" 1 . 

[0029] Figures 3A and B show the magnitude and phase respectively of the frequency response of this predictor for 
various values of n. At 107 and 185 Hz, the chosen prediction frequencies / 1 and / 2 , amplitude and phase are always 
correct (prediction is exact). At other frequencies the result is undefined, but for n=4 (minimum) the result is still good. 
[0030] it will be appreciated that the DC gain can be forced to 1 by adding a column vector containing 1 's at the right 

20 side of the matrix A, and adding one offset coefficient c to the vector containing a and b coefficients. The signal y is 
then supposed to include a DC component: 
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3<0=c+ £ tf £ s^2^]+^co^2^t] (6) 



n must then be 2m+1, minimum. 

[0031] Also, choosing more base sinusoids requires a larger /?, which generally leads to a less favorable amplitude/ 
phase behavior. Further, the prediction horizon can have any value and is not restricted to whole number of samples. 

30 This is achieved by shifting the time scale. 

[0032] The implementation of this is shown in Figure 2. The predictor receives the substrate table position error and 
predicts its value one sample in the future based on the chosen prediction frequencies. The predictor is implemented 
as a border FIR filter (two prediction frequencies + a DC component). The predictor output is refined in the frequency 
domain by one or more conventional filters depicted by the block 'H^ter'. The output of H nttcr is filtered by low-pass 

33 filter 1 4, with a gain of -4, and added to the mask table setpoint SPR. Also, the output of H Fnter is double-differentiated, 
multiplied by the mask table mass m, filtered by a low-pass filter 16 which has the same characteristics as low-pass 
filter 14, and added to the mask table controller output. 



Embodiment 2 

[0033] According to a second embodiment of the invention, which is shown in Figure 4 and is otherwise the same 
as the first embodiment, the predictor is used to predict the second derivative of y by evaluating: 



dt 2 



2=1. . .771 



Hence, the only difference is the determination of the FIR filter parameters. 

[0034] Special care must be taken for the lower frequencies. Figures 7A and B show the response of the double- 
derivative predictor when making use of the same base frequencies as in the first embodiment: 107 and 185 Hz. it can 
be seen that below 107 Hz, the magnitude becomes flat, while a +2 slope is required. To repair this behavior, a very 
55 low frequency of 1 rad/s is included in the base frequencies, forcing a proper response for lower frequencies as also 
shown in Figures 7A and B. Introducing a DC gain of 0 does not help here, because this can also be obtained by a +1 
slope. A low, nonzero extra base frequency is therefore required. The cost is, again, a larger high-frequency gain. 
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[0035] Rgure 8 shows the discrete potes and zeros c! the FIR filter that results from th!s design technique. A straight- 
forward double differentiator has two zeros in z=+1 and two poles in z=0. The sine-fit predictor creates three pole/zero 
pairs at strategic locations. Note that this filter has no continuous-time counterpart. 

[0036] The implementation of this is shown In Rgure 4. The substrate table position error is fed through the block 
' H Raer'« wnj ch refines the phase at specific frequencies present in the substrate table position error. The output of H RIter 
is filtered by low-pass filter 14, with a gain of -4, and added to the mask table setpolnt SPR. Also, the output of H Rlter 
is fed to the acceleration error predictor, which predicts the substrate table acceleration error in the future. The predictor 
is implemented as a 5 m -order FIR filter (three prediction frequencies, of which one is kept very low to force a correct 
low-frequency behaviour). The predictor output is multiplied by the mask table mass m, filtered by a iow-pass filter 1 6 
which has the same characteristics as low-pass filter 14, and added to the mask table controller output. 
[0037] Figures 5A, 5B and 6 illustrate the improvement obtained with the predictor of the present invention. Figures 
5A and B show the magnitude and phase of the transfer function y^/e^ for the prior art arrangement a) without optimal 
conventional filters (solid line), b) the prior art with optimal conventional filters (long dashed line) and c) the present 
embodiment (short dashed line). It will be seen that the transfer function of the present Invention using a predictor Is 
much closer to the idea! of magnitude = 1 in Figure 5A and 0 phase tag in Figure 5B for all relevant frequencies below 
300Hz. The feedthrough factor ewus /e vjs te shown in Figure 6. In this Figure, 0 is idea! whilst values below 1 denote 
improvement and values above 1 denote degradation, it will be seen that introduction of the prediction of the present 
invention improves the feedthrough factor at frequencies up to about 350 Hz compared to the prior art at the cost of 
some increase at higher frequencies, which are less important. 

[0038] The effect on the feedthrough mechanism is shown In Figures 5A, B and 6, as discussed above. At dominant 
frequencies in ews, the feedthrough factor has improved a great deal, at the cost of a larger high-frequency gain. 
[0039] Results for moving average (MA) and moving standard deviation (MSD) are shown in the table, which also 
includes the best-obtained results using lead-iag filters only. 





MAX [nm] 


MA-Y [nm] 


MSD-X[nm] 


MSD-Y [nm] 


©ws 


8.7 


20.2 


20.9 


31.3 


e^s, original 


2.2 


2.9 


11.6 


9.4 


eyvps, lead-lag filters 


1.9 


2.3 


8.1 


7.5 


©wrs* acc - predictor 


1.5 


1.9 


6.8 


6.3 
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[0040] In a third embodiment of the Invention, which is otherwise the same as the first embodiment, the sine-based 
predictor is replaced by a predictor based on a polynomial extrapolation of previous data points. 
[0041] On the last n samples a polynomial can be fitted (having a maximum order of (n-1)), that can be used for 
extrapolation to the next sample. A basic example is shown in Figure 9. Here, the last four samples are used to fit a 
straight line through, that is used to predict the next sample. 

[0042] Experiments showed that it is not useful to use more than the minimum required number of past datapoints. 
Using more points introduces an unfavorable phase lag. Hence, using n points In the past, using an (/> f)th-order 
polynomial is the best one can do. Figures 1 0A and B show the frequency response for various values of n. It is seen 
that the 3rd-order polynomial is flat up to around 300 Hz, above which a large gain occurs. The larger the polynomial 
order, the betterthe phase advantage, but also the larger the high-frequency gain. This is true in general: a large phase 
advantage is combined with unfavorable high-frequency gain. 

[0043] Whilst specific embodiments of the Invention have been described above, it will be appreciated that the in- 
vention may be practiced otherwise than as described. For example, the invention may be applied in reverse, I.e. with 
the position error of the patterning means (e.g. mask table) predicted and fed into the control of the substrate table. 
The description is not intended to limit the invention. 
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Claims 

1 . A lithographic projection apparatus comprising: 

a radiation system for providing a projection beam of radiation; 

a support structure for supporting patterning means, the patterning means serving to pattern the projection 
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beam according to a desired pattern; 
a substrate table for holding a substrate; 

a projection system for projecting the patterned beam onto a target portion of the substrate, 
first driving means, for moving the support structure in a given reference direction substantially parallel to the 
s plane of the patterning means; 

- second driving means, for moving the substrate table parallel to the reference direction so as to be synchronous 
with the motion of the support structure; 

first measuring means, for determining the momentary position of the patterning means with respect to a fixed 
reference point; 

10 - second measuring means, for determining the momentary position of the substrate table with respect to a 

fixed reference point; and 

means for comparing the measured momentary position of the substrate table with a desired momentary 
position of the substrate table, for generating a position error signal in accordance with a difference between 
the said two positions; 
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characterized In that the apparatus further comprises: 



prediction means for generating a future position error signal based on one or more previous position error 
signals and for passing that future position error signal to correction means which serve to adjust the momen- 
20 tary position of the patterning means so as to compensate for such difference. 

2. Apparatus according to claim 1 wherein said prediction means is adapted to predict a first or higher derivative of 
said position error signal. 

25 3. Apparatus according to claim 1 or 2 wherein said prediction means comprises a sine-based predictor operating 
on n previous samples of the position error signal. 

4. Apparatus according to claim 1 or 2 wherein said prediction means is adapted to effect a polynomial prediction 
based on n previous samples of the position error signal. 

30 

5. Apparatus according to claim 3 or 4 wherein said prediction means comprises a finite impulse response filter. 

6. Apparatus according to any one of claims 3 or 4 wherein n is the minimal value required to obtain a full and sufficient 
set of equation. 

35 

7. An apparatus according to any one of the preceding claims, wherein the support structure comprises a mask table 
for holding a mask. 

8. An apparatus according to any one of the preceding claims, wherein the radiation system comprises a radiation 
source. 

9. A lithographic projection apparatus comprising: 



a radiation system for providing a projection beam of radiation; 

a support structure for supporting patterning means, the patterning means serving to pattern the projection 
beam according to a desired pattern; 
a substrate table for holding a substrate; 

a projection system for projecting the patterned beam onto a target portion of the substrate, 

first driving means, for moving the support structure in a given reference direction substantially parallel to the 

plane of the patterning means; 

second driving means, for moving the substrate table parallel to the reference direction so as to be synchronous 
with the motion of the support structure; 

first measuring means, for determining the momentary position of the patterning means with respect to a fixed 
reference point; 

second measuring means, for determining the momentary position of the substrate table with respect to a 
fixed reference point; and 

means for comparing the measured momentary position of the patterning means with a desired momentary 
position of the patterning means, for generating a position error signal in accordance with a difference between 
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the said two positions; 

characterized In that the apparatus further comprises: 

s - prediction means for generating a future position error signal based on one or more previous position error 

signals and for passing that future position error signal to correction means which serve to adjust the momen- 
tary position of the substrate table so as to compensate for such difference. 

10. A device manufacturing method comprising the steps of: 

w 

providing a substrate that is at least partially covered by a layer of radiation-sensitive material; 
providing a projection beam of radiation using a radiation system; 
using patterning means to endow the projection beam with a pattern in its cross-section; 
projecting the patterned beam of radiation onto a target portion of the layer of radiation-sensitive materia!, 
is wherein said step of projecting comprises the sub-steps of: 

moving a support structure supporting said patterning means in a given reference direction substantially 
parallel to the plane of the patterning means; 

moving the substrate table parallel to the reference direction so as to be synchronized with the motion of 

20 the support structure; 

determining the momentary position of the mask table with respect to a fixed reference point; 
determining the momentary position of the substrate table with respect to a fixed reference point; and 
comparing the measured momentary position of the substrate table with a desired momentary position of 
the substrate table, for generating a position error signal in accordance with a difference between the said 

25 two positions; 

characterized by the further step of predicting a future position error based on one or more previous position error 
signals and adjusting the momentary position of the mask tabte so as to compensate for such difference. 

30 1 1 . A device manufactured according to the method of claim 1 0. 
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Fig. 3 
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